FusionRCNN: LiDAR-Camera Fusion for Two-Stage 3D Object Detection

نویسندگان

چکیده

Accurate and reliable perception systems are essential for autonomous driving robotics. To achieve this, 3D object detection with multi-sensors is necessary. Existing detectors have significantly improved accuracy by adopting a two-stage paradigm that relies solely on LiDAR point clouds proposal refinement. However, the sparsity of clouds, particularly faraway points, makes it difficult LiDAR-only refinement module to recognize locate objects accurately. address this issue, we propose novel multi-modality approach called FusionRCNN. This effectively efficiently fuses camera images in Regions Interest (RoI). The FusionRCNN adaptively integrates both sparse geometry information from dense texture unified attention mechanism. Specifically, first utilizes RoIPooling obtain an image set size gets sampling raw points within proposals RoI extraction step. Then, leverages intra-modality self-attention enhance domain-specific features, followed well-designed cross-attention fuse two modalities. fundamentally plug-and-play supports different one-stage methods almost no architectural changes. Extensive experiments KITTI Waymo benchmarks demonstrate our method boosts performances popular detectors. Remarkably, improves strong SECOND baseline 6.14% mAP outperforms competing approaches.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fusion of 3D-LIDAR and camera data for scene parsing

Fusion of information gathered from multiple sources is essential to build a comprehensive situation picture for autonomous ground vehicles. In this paper, an approach which performs scene parsing and data fusion for a 3DLIDAR scanner (Velodyne HDL-64E) and a video camera is described. First of all, a geometry segmentation algorithm is proposed for detection of obstacles and ground areas from d...

متن کامل

Robust Curb Detection with Fusion of 3D-Lidar and Camera Data

Curb detection is an essential component of Autonomous Land Vehicles (ALV), especially important for safe driving in urban environments. In this paper, we propose a fusion-based curb detection method through exploiting 3D-Lidar and camera data. More specifically, we first fuse the sparse 3D-Lidar points and high-resolution camera images together to recover a dense depth image of the captured sc...

متن کامل

A 3d Time of Flight Camera for Object Detection

The knowledge of three-dimensional data is essential for many control and navigation applications. Especially in the industrial and automotive environment a fast and reliable acquisition of 3D data has become a main requirement for future developments. Moreover low cost 3D imaging has the potential to open a wide field of additional applications and solutions in markets like consumer electronic...

متن کامل

LIDAR-based 3D Object Perception

This paper describes a LIDAR-based perception system for ground robot mobility, consisting of 3D object detection, classification and tracking. The presented system was demonstrated on-board our autonomous ground vehicle MuCAR-3, enabling it to safely navigate in urban traffic-like scenarios as well as in off-road convoy scenarios. The efficiency of our approach stems from the unique combinatio...

متن کامل

LiDAR-Camera Calibration using 3D-3D Point correspondences

With the advent of autonomous vehicles, LiDAR and cameras have become an indispensable combination of sensors. They both provide rich and complementary data which can be used by various algorithms and machine learning to sense and make vital inferences about the surroundings. We propose a novel pipeline and experimental setup to find accurate rigid-body transformation for extrinsically calibrat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Remote Sensing

سال: 2023

ISSN: ['2315-4632', '2315-4675']

DOI: https://doi.org/10.3390/rs15071839